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SYSTEM AND METHOD FOR QUERY 

REFINEMENT TO ENABLE 
IMPROVED SEARCHING BASED ON 

IDENTIFYING AND UTILIZING 
POPULAR CONCEPTS RELATED TO 

0 USERS' QUERIES 

2 Background of Invention 

[0001] The invention relates generally to searching for relevant data entities based on a 
4= search query, specifically in the context of ambiguous or under-specified queries. 

1 !! 

n More particularly, the invention relates to helping users to refine their search queries 

H| by identifying search concepts related to the user's search query, providing the means 

M= for the user to use these concepts to refine their query and submit an enhanced query 

^ based on such concepts, and thus access information more specific to their needs. 

[0002] One of the greatest strengths and greatest weaknesses of the Internet is the vast 
amount of information that is distributed over all the computers connected on the 
Internet. This is one of the Internet's greatest strengths in that individuals have access 
to great amounts of information on almost any topic imaginable. However, this is also 
one of the Internet's greatest weaknesses in that, because of the vast amount of 
information, it is difficult to know what information on a desired topic is available, and 
where to go to find the information. 

[0003] Search engine technology attempts to overcome this weakness of the Internet by 
providing an indexed access to a collection of web pages that a user can search. The 
user typically enters a search query. The search engine then finds the web pages that 
contain or otherwise relate to the search query, and this list of web pages is presented 
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to the user. There are a number of different ways that search engines determine which 
web pages are relevant to a given search query, such that those web pages are 
presented to the user. 

[0004] . First, one type of search engine constantly scans the Internet, in a process 

referred to as spidering. This type of search engine has been popularized by ALTA 
VISTA and GOOGLE, among others. Each page of a web site that is visited by the 
spider is cataloged for the words that appear in the web site. This information is 
indexed and stored in a search engine database. When a user enters a search query, 
the search engine matches the query against the search engine database to find the 
web pages that are most relevant to the query by some measure. For example, the 
search engine may determine the number of times the query appears in a given web 
page to determine its relevance, or the search engine may determine the number of 
other web pages that link to the given web page in which the query appears to 
determine its relevance. 

This type of search engine is disadvantageous in that many search queries contain 
words that are related to more than what the user is searching. For example, the user 
may be looking for web pages regarding the golfer Tiger Woods . However, if the user 
just enters the word Tiger as the search query, the search engine is likely to return 
many web pages related to the animal tiger , as well as to the golfer Tiger Woods . 
Furthermore, if the user enters the words Tiger Woods , the search engine may also 
return web pages that include the words tiger and woods , but which do not 
necessarily relate to the golfer Tiger Woods . 

[0006] Another type of search engine compares a search query to web pages cataloged in 
a topical directory. This type of search engine has been popularized by YAHOO! and 
LOOKSMART. A team of people assigns web sites to one or more different categories 
within the directory. When a user enters a search query, the search engine matches 
the query against the directory of web pages, and returns both the categories and the 
individual web pages that are relevant to the query. For example, in response to a 
Tiger Woods query, the search engine may return the category Sports .Goffers .Tiger 
Woods and the category Animals. Tigers , as well as web pages that contain both the 
words tiger and woods . 
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[0007] This type of search engine also has its disadvantages. If the user enters a query 

too broad to find adequately specific and targeted results, it is often difficult to guess 
a query that would easily and accurately narrow the query to the desired area. 

[0008] Other failings are common to all of these and other types of search engines. Most 
are unforgiving as to misspelled words, or abbreviated variants for desired topics. For 
example, if the user enters in tigr woods instead of Tiger Woods , search engines are 
likely not to return many relevant pages regarding the golfer. Search engines may also 
provide results that are considered inappropriate by many users, or, in the case of 
children, their parents. For example, a user may enter in as a query the name of his or 
her favorite singer. Besides web sites geared towards providing information about the 
singer, search engines may also return X-rated sites that claim to provide 
inappropriate pictures of the singer. 

:? [0009] Another failing of existing types of search engines is that they assume a level of 

r=~= 
= ^f - 

E3 searching experience or sophistication on the part of their users that may not exist. In 

other words, the quality of search results they return frequently corresponds to how 
good the search query is that the user entered. Users who are less competent in 
formulating search queries are therefore likely to receive poorer search results from 
search engines as compared to users who are more competent in formulating queries. 
For example, less knowledgeable users may enter queries that are overly broad, or 
alternatively, overly specific. Overly broad queries are likely to generate search results 
that contain a number of irrelevant web pages, whereas overly specific queries are 
likely to generate search results that may not include a number of relevant web pages. 

[0010] For these and other reasons, therefore, there is a need for the present invention. 

Summary of Invention 

[0011] 

The invention relates generally to refining a user query. In a method of one 
embodiment, a query is received from a user, and then mapped to one or more search 
concepts. A list of search concepts associated with the query is then displayed. 
Alternatively or additionally, the search concepts associated with the query are used to 
provide a set of improved search results instead of being displayed. In a method of 
still another embodiment, a number of queries from a number of users are analyzed 
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to identify two or more search concepts, and a popularity value is assigned to them 
based on the queries. Thus, the relative popularity of the respective search concepts 
can be determined. Alternatively or additionally, a preferred search query for the 
search concepts can be determined. 

[0012] Furthermore, one specific embodiment of the invention relates to searching and 
query refinement based on matching user's queries to key phrases of concepts that 
have a popularity measured by the appearance of the concept's title and key phrases 
within the search engine's log of all queries. In particular, this embodiment relates to 
helping users to refine their search queries by identifying popular concepts related to 
the user's search query, providing the means for the user to use these concepts to 
refine their query and submit an enhanced query based on such concepts, and thus 
access information more specific to their needs. 

vP [0013] For example, a concept may be Tiger Woods . The key phrases associated with this 
m concept may include tiger , tigr f greatest golfer; tiger woods the golfer , as well as 

lif other key phrases. Another concept may be the animal tiger , with its own key 

45 phrases. Each concept may have one or more data entities (web page links, other 

•■at 

information) considered most relevant to the concept associated with it. 

Sj [001 4] To determine the popularity of each concept, the key phrases and concept words 
can be matched to a query log of past queries. The popularity of each concept is 

p based on at least the number of different query phrases within the query log that 

match the key phrases of the concept, and the number of times each of these query 
phrases appears within the query log. More particularly, a number of popularity points 
proportional to the number of times a query phase appears in the query log is added 
to the concept where the query phase matches a key phrase that is unique to the 
concept. For a key phrase that appears in more than one concept, a number of 
popularity points proportional to the number of times a query phrase appears in the 
query log that matches the key phrase is apportioned among such concepts. 

[001 5] When a user enters a query, the query is matched against the key phrases and 

titles of the concepts to yield matching concepts. Matching concepts are those having 
one or more key phrases that match the query. For example, a user entering in tiger 
may have returned to him or her two concepts, Tiger Woods , and tiger (the animal). 
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[001 6] A general popularity measure of each concept may also be determined and 

returned to the user. The popularity measure of a concept reflects its popularity within 
the query log as indicated by the number of popularity points that have been added 
for the concept. For example, in one implementation, the popularity measure can be 
determined as five times the log of the popularity points of the concept divided by the 
log of the popularity points of the most popular overall concept. This popularity 
measure returns a number from one to five indicating the relative popularity measure 
of the concept. Note that other implementations of the popularity measure may be 
devised and used. 

[001 7] A preferred search query of each concept is also determined and returned to the 
user. For example, the key phrase of a concept that is uniquely associated only with 
the concept, and that has a greatest popularity of any key phrase of the concept 
4f within the query log, may be selected as the preferred search query for that concept. 

gi The use of this preferred search. query as the user's next query is aimed at obtaining 

m 

^ the best results related to the associated concept. The popularity of a key phrase is 

O determined by the number of popularity points added to the concept as a result of the 

□ key phrase matching a query phase within the query log. The invention is not 

L. restricted to data entities of type web links, but can be used to access data or data 

S s entities of a number of different types including documents, document links, web 

pages, video files etc. Although the invention described is for data entities that are 
p web page links, this is for the purposes of example only, and does not represent a 

limitation of the invention in any way. 

[001 8] The invention overcomes some of the disadvantages of the prior art indicated in 
the background. For example, overly broad search queries entered by users can be 
refined such that the users receive links to get relevant search results while the 
number of irrelevant web pages is significantly reduced. Additionally, since the 
concepts displayed are editorially chosen, these are free of pornography or other 
undesirable material, and may safely be used as steps in refinement. 

[0019] 

The invention includes methods and computer-readable media of varying scope. 
Still other aspects, advantages, and embodiments of the invention, besides those 
described in this summary, will become apparent by reading the detailed description 
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that follows, and by referring to the drawings. 

Brief Description of Drawings 

[0020] FIG. 1 is a diagram showing the structure of a concept, along with its key phrases 
and associated web sites, according to an exemplary embodiment of the invention. 

[0021] FIG. 2 is a flowchart showing the overall methodology of an exemplary 
embodiment of the invention. 

[0022] FIG. 3 is a flowchart of a method showing the popularity calculation stage of FIG. 2 
in more detail, according to an exemplary embodiment of the invention. 

[0023] FIG. 4 is a diagram showing the structure of a query log, according to an 
exemplary embodiment of the invention. 

[0024] FIG. 5 is a diagram showing the results of matching the query log to the key 
phrases of a concept, according to an exemplary embodiment of the invention. 

[0025] FIG. 6 is a flowchart of a method showing the imaging stage of FIG. 2 in more 
detail, according to an exemplary embodiment of the invention. 

[0026] FIG. 7 is a flowchart of a method showing the query stage of FIG. 2 in more detail, 
according to an exemplary embodiment of the inyention. 

[0027] FIG. 8 is a flowchart of a method showing how the results of the query stage of 
FIG. 7 can be output, according to an exemplary embodiment of the invention. 

[0028] FIG. 9 is a diagram of an exemplary web page depicting the integration of search 
results obtained in accordance with a preferred embodiment of the invention with 
other search results. 

[0029] FIG. 1 0 is a diagram of a web page showing another way as to how the search 

results of the invention can be output, according to an exemplary embodiment of the 
invention. 

[0030] FIG. 1 1 is a diagram of a system that implements an exemplary embodiment of 
the invention. 
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[0031] FIG. 12 is a diagram of another system that implements an exemplary 
embodiment of the invention, as compared to FIG. 1 1 . 

[0032] FIG. 1 3 is a block diagram of a representative computing system environment in 

accordance with which exemplary embodiments of the invention may be implemented. 

Detailed Description 

[0033] In the following detailed description of exemplary embodiments of the invention, 
reference is made to the accompanying drawings that form a part hereof, and in which 
is shown by way of illustration specific exemplary embodiments in which the invention 
may be practiced. These embodiments are described in sufficient detail to enable 
those skilled in the art to practice the invention. Other embodiments may be utilized, 
and logical, mechanical, electrical, and other changes may be made without departing 
□ from the spirit or scope of the present invention. The following detailed description is, 

]= therefore, not to be taken in a limiting sense, and the scope of the present invention 

ffl is defined only by the appended claims. 

Q 

^ [0034] Concepts, key phrases, and associated data entities 
z [0035] 

JT 3 ; FIG. 1 shows a diagram 1 00 of a concept 1 02, its key phrases 1 04, and its 

M= associated web site 106, or other associated data entities. The concept 102 is one or 

^ more concept words that are the title of the concept that a user may wish to search 

N= for with a search engine. A concept may be a topic, for example. It is typically 

manually determined. The key phrases 1 04 for the concept 1 02 are other phrases of 
one or more words that a user may enter as a search query to find information 
regarding the concept 1 02. The key phrases 1 04 may be alternative key words for the 
concept 102, misspellings of the concept 102, shorthand notation for the concept 
1 02, or other phrases for the concept 1 02. The key phrases 1 04 are typically manually 
determined or edited, by examining, for example, query logs of past search queries to 
determine how users search for the concept 1 02. The associated web sites 1 06 are the 
data entities that are returned to the user when he or she searches for the concept 
1 02. They can be links to web sites related to the concept 1 02, or other types of data 
entities. For example, they may be music file links, image file links, or other types of 
files or links. The associated web sites 106 are typically manually determined or 
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edited as well. 

[0036] An example of a concept 1 02 may be the popular singer Britney Spears . The title 
of the concept 1 04 are the words Britney Spears . The key phrases 1 04 for this 
concept may be just the word Britney, britneyspears.com , the name of her albums, as 
well as misspellings of her name, such as Brittney, Brittany, and so on. The associated 
web sites 106 may include the official Britney Spears web site, as well as various fan 
sites, and sites at which fans can purchase her music. The web sites 1 06 may also be 
edited, to ensure that no inappropriate, adult-oriented web sites are included, so that 
the web sites 1 06 are children friendly. 

FIG. 2 is a flowchart of a method 200 showing the overall process followed by an 
exemplary embodiment of the invention. This process is described in summary, with a 
detailed description provided for each stage in subsequent sub-sections. In the 
popularity calculation stage 202, query logs that provide frequency of occurrence of 
each query on a historical basis, as well as a list of concepts and their associated key 
phrases, are used to determine the popularity of each key phrase of each concept. 
This popularity is used in turn to determine the popularity of each concept. The 
popularity of a concept is based on the queries that match the key phrases associated 
with the concept, and the occurrence of each of those queries within the query log. 

In the imaging stage 204, the output of the refinement stage is indexed, and 
added to the data image, or database, used by the search engine to answer queries 
entered by users. Both the refinement stage 202 and the imaging stage 204 occur 
before a user has entered a query. That is, they are offline stages. They can be 
performed on a periodic basis to update the popularity of concepts based on newly 
generated query logs. 

[0039] q Uer y stage 206 is performed each time a query is entered by a user, and is 

considered an online stage. A search query is a query phrase that includes one or 
more words entered by the user. The search engine searches the database for related 
concepts by textually matching the words of the query phrase against the key phrases 
of the concepts, as well as the words in the concept titles, resulting in zero or more 
matching concepts. A concept is said to match a query if the query matches a 
keyphrase and/or one or more words in the title. Note that the keyphrases and the 
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title words may be matched in different ways to the query. For example, we may use 
exact string match on the keyphrases, while we may require each of the words in the 
query to be contained (as a whole word) in the title without regard to order. The use 
of different matching schemes is not a required part of finding matching concepts. 
The returned list of matching concepts is sorted by decreasing popularity, such that 
the most popular choice is listed first. The associated data entities of these concepts 
may also be shown, or may be displayed when the user selects a concept. A visual 
measure of the relative popularity of a concept as compared to the other concepts 
may also be displayed, as well as a preferred search query for each concept. 

[0040] Refinement stage 

[0041] FIG. 3 is a flowchart showing the refinement stage 202 of FIG. 2 in more detail. 

First, the concept words of the concepts and the key phrases are matched against the 
query phrases of the query log (300). An example query log is shown FIG. 4. The 
query log 400 has a number of queries 400a, 400b, and 400m. Each query is a query 
phrase that includes one or more words. A query may appear more than once in the 
query log 400. Alternatively, each query in the query log 400 may be a unique query, 
and have associated therewith the number of times the query was entered during the 
logging period. For example, in the former case, if the query golf balls was entered 
700 times during the logging period, it would appear 700 times in the query log 400. 
In the latter case, however, the query would appear once, along with the number 700 
to indicate that it was entered 700 times during the logging period. 

[0042] The term query log is used generally. A query log may be a historical log of 

queries that have been entered by users during a logging period, and this is how it is 
predominantly used. However, a query log may also be modified to actively affect the 
popularity of concepts. For example, a query log may be populated with queries that 
were not actually entered so that the popularity of concepts are predictively modeled. 
For example, if it is known that the Super Bowl \n\\\ be a popular search query in the 
coming months, the query log may be modified to add this query a large number of 
times to the log. Other ways to affect the ensuing popularity of concepts, for other 
predictive modeling, or for demographic or for other reasons, are also encompassed 
within the term query log. 
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[0043] Referring back to FIG. 3, a popularity point is added to a concept for the number 
of times each query phrase appears in the query log that matches a key phrase unique 
to the concept (302). That is, a popularity point is added to a concept for the number 
of times each query phrase matches the concept words of a concept or a key phrase 
that is unique to the concept. For example, the query Brittany may appear 350 times 
in the query log. For the concept Britney Spears , there may be a key phrase Brittany 
that is unique to this concept, such that no other concept has this key phrase. 
Therefore, 350 popularity points are added to the concept Britney Spears \ As another 
example, the query Britney Spears may appear 200 times in the query log. Because 
this query matches the concept words of the concept Britney Spears , 200 popularity 
points are added to the concept. 

[0044] Next, for query phrases in the query log that match key phrases of more than one 
concept, a number of popularity points equal to the number of times such a query 
phrase appears in the query log is divided among such concepts (304). For example, 
the query tiger may appear 400 times in the query log. There may be two concepts 
that have the key phrase tiger , the concept Tiger Woods , and the concept wild tiger . 
Therefore, the 400 popularity points for the query tiger are. apportioned between 
these two concepts. One way to apportion the popularity points is to proportionally 
divide the points among the concepts based on their amassed popularity points 
resulting from 302. For example, the concept Tiger Woods may have 900 popularity 
points so far, and the concept wild tiger may have 1 00 popularity points so far. 
Therefore, 9096 of the 400 popularity points for the query tiger art added to the 
concept Tiger Woods , and 1 0% are added to the concept wild tiger . Other ways to 
apportion the popularity points, can also be used, however, such as equally dividing 
the points among the concepts that have such matching key phrases. 

[0045] resu | t Q f 300, 302, and 304 is that each concept has a number of popularity 

points added thereto, based on the matching of query phrases to the key phrases of 
the concept. This is shown in the diagram 500 of FIG. 5. The concept 1 02 has a 
number of popularity points 502. Each key phrase 1 04a, 1 04b, . . ., 1 04n contributes 
a number of popularity points 502a, 502b, . . ., 502n, respectively, to the number of 
popularity points 502 of the concept itself. Adding the popularity points 502a, 502b, . 
. ., 502n together yields the number of popularity points 502 of the concept 1 02 
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itself. 

[0046] Referring back to FIG. 3, two other parts of the refinement stage 202 may be 

performed, but are optional. First, a relative popularity measure of each concept may 
be determined (306). This popularity measure reflects the popularity of each concept 
as compared to the other concepts, as the concepts appear in the query log by their 
concept words and key phrases. This popularity may be calculated in many different 
ways. For example, the popularity measure may be measured on a scale from zero to 
five, where zero means the concept is least popular, and five means the concept is 
most popular. In such a case, the popularity measure of a concept can be determined 
in this implementation as five times the log of the popularity points attributed to the 
concept divided by the log of the popularity points attributed to the most popular 
concept. 

[0047] Second, a preferred search query for each concept may be determined (308). This 
is the search query that is most likely to result in useful search results for a concept. 
The preferred search query may be determined for a concept by selecting a key phrase 
that is unique to the concept, and which has the greatest popularity as compared to 
any other unique key phrase for the concept and the concept words of the concept. 
The popularity of a key phrase is indicated by the number of popularity points added 
to the concept as a result of the key phrase matching a query phrase within the query 
log. For example, if for the concept Britney Spears the. key phrase Britney is unique to 
the concept and has more popularity points than the concept words Britney Spears 
and the other unique key phrases do, then the key phrase Britney \s selected as the 
preferred search query. If no unique keyphrase is identified by this method, editorial 
means may be used to add such a keyphrase. 

[0048] Imaging stage 

[0049] FIG, 6 is a flowchart showing the imaging stage 204 of FIG. 2 in more detail. First, 
the output of the refinement stage is indexed (600). The output may also then be 
compressed. The output of the refinement stage is the list of concepts and their 
determined popularity, as well as their key phrases and associated data entities. This 
output is indexed and optionally compressed in such a way that it is compatible with 
the data image or database that is used by the search engine at query time, to run a 
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search query entered by the user. The resulting indexed and optionally compressed 
output is then added to the database or data image used by the search engine at 
query time (602). The database or data image may also include other ways to find 
relevant data entities for search queries. For example, the database or data image may 
include the data for the techniques used by current search engines as described in the 
background section. 

[0050] Query stage 

[0051] FIG. 7 is a flowchart showing the query stage 206 of FIG. 2 in more detail. First, a 
query is received (700). For example, a user may enter a search query, such that the 
user wishes to receive relevant data entities, such as web page links, for the search 
query. The received query is next matched against words in the concept title and/or 
associated key phrases of concepts, to determine concepts that match the query 
(702). As mentioned earlier, a concept is said to match a query if the query matches a 
keyphrase and/or one or more words in the titles. Note that the keyphrases and the 
title words may be matched in different ways to the query an exact string match on 
the key phrases may be used, while each of the words in the query may be required to 
be contained (as a whole word) in the title without regard to order. For example, if the 
search query is the word tiger , the query may match the key phrase tiger tor the 
concept Tiger Woods , as well as the key phrase tiger for the concept wild tiger . The 
matching concepts are sorted by and in descending order of their popularity (704). 
Their popularity is measured by the number of popularity points that were added to 
each concept. The sorted matching concepts are then output (706). If there are a large 
number of matching concepts, only a predetermined number, such as the first two or 
three, may be output, or alternatively all the concepts may be output. The output may 
also include the relative popularity of each of the matching concepts, and/or the 
preferred search query for each of the matching concepts. 

[0052] 

FIG. 8 is a flowchart of a method 800 showing how the output of the query stage 
206 of FIG. 2 can be presented to the user, where the associated data entities of the 
concepts are web page links. First, the matching concepts and their relative popularity 
are displayed (802). Where the relative popularity is a number from zero to five, this 
may be indicated to the user by displaying an equal number of icons, such as shaded 
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stars. One or more featured web sites may optionally next be shown (804). 

[0053] One or more web sites, that is, web page links, associated with one or more of the 
matching concepts may also optionally be displayed (806). For example, there may be 
ten places reserved for showing such web page links. An equal number of associated 
web page links may be shown from each matching concept, or the places may be 
divided proportionally among the matching concepts based on their popularity. 
Finally, one or more web pages returned from other types of search engines may be 
optionally displayed (808). Such web pages may have been returned by current search 
engines as have been described in the background section. 

[0054] FIG. 9 is a diagram of a window 900 that shows an example of the output 

resulting from the method 800 of FIG. 8. The user's search query is shown in the box 

_ 901 . The matching concepts and their relative popularities that are displayed by 

yg performing 802 are shown in section 902 of the window 900. The featured web site or 

r; sites that are displayed by performing 804 are shown in section 904. The web sites, 

fU that is, web page links, associated with the matching concepts that are displayed by 

1= performing 806 are shown in section 906 of the window 900. Finally, any other 

^ relevant web pages that are displayed by performing 808 are shown in section 908. 

%i [0055] If one of the matching concepts displayed in section 902 is selected by the user, 
^ another window may appear, as shown as the window 1 000 of FIG. 1 0. The preferred 

O search query for the selected concept is shown in the box 1 002. The associated web 

sites, or web page links, for the selected matching concept are then displayed in 
section 1 006 of the window 1 000. FIGs. 8, 9, and 1 0 only depict an example of the 
display of associated data entities of matching concepts that can be used in 
accordance with the invention. Other manners by which associated data entities of 
matching concepts can be displayed are also contemplated by the invention. 

[0056] System and device implementation 

[0057] 

FIG. 1 1 is a diagram of a system 1 1 00 that can implement the invention as has 
been described. The concepts, key phrases, and associated data entities . 1 102, as well 
as the query log 1 1 04, are used by the popularity tool 1 1 06 to perform the refinement 
stage 202 of FIG. 2 as has been described. The output of the popularity tool is used 
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by the imaging tool 1 1 08 to perform the imaging stage 204 of FIG. 2 as has been 
described. The imaging tool 1 1 08 results in a data image or a database that can be 
integrated into the search engine database 1112, along with other search engine data 
1110. 

[0058] The query run time tool 1114 runs a search against the search engine database 
1 1 1 2 for the search query 1 1 1 6, as has been described as the query stage 206. The 
output of the query run time tool 1114 includes query results 1118 based on the 
query 1116. The query 1116 can also be added to the query log 1 1 04, for future 
periodic use of the popularity tool 1 1 06, and so on. Each of the tools 1 1 06, 1 1 08, and 
1114 may be a separate computer or computerized device, a separate computer 
program, or part of the same computer program. 

[0059] FIG. 1 2 is a diagram of an environment 1 200 in which the invention can be used. 
A client 1 202 and a server 1 206 are both communicatively connected to the Internet 
1 204. The server 1 206 has access to the search engine database 1 1 1 2. A user enters 
l J? a search query on the client 1 202, which is sent to the server 1 206 over the Internet 

45 1 204. The server 1 206 performs the query stage 202 of FIG. 2 as has been described, 

matching the search query received from the client 1 202 against the database 1112. 
O The results are then returned back to the client 1 202 over the Internet 1 204, where 

U they can be displayed for the benefit of the user. 

^ [0060] FIG. 1 3 illustrates an example of a suitable computing system environment 1 0 on 
which the invention may be implemented. For example, the environment 1 0 may 
implement the client 1 202, the server 1 206, and/or the system 1 1 00 that have been 
described. The computing system environment 1 0 is only one example of a suitable 
computing environment and is not intended to suggest any limitation as to the scope 
of use or functionality of the invention. Neither should the computing environment 1 0 
be interpreted as having any dependency or requirement relating to any one or 
combination of components illustrated in the exemplary operating environment 10. In 
particular, the environment 1 0 is an example of a computerized device that can 
implement the servers, clients, or other nodes that have been described. 

[0061] invention is operational with numerous other general purpose or special 

purpose computing system environments or configurations. Examples of well known 
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computing systems, environments, and/or configurations that may be suitable for use 
with the invention include, but are not limited to, personal computers, server 
computers, handor laptop devices, multiprocessor systems, microprocessorsystems. 
Additional examples include set top boxes, programmable consumer electronics, 
network PCs, minicomputers, cell phones, mainframe computers, distributed 
computing environments that include any of the above systems or devices, and the 
like. 

[0062] The invention may be described in the general context of computerinstructions, 
such as program modules, being executed by a computer. Generally, program 
modules include routines, programs, objects, components, data structures, etc. that 
perform particular tasks or implement particular abstract data types. The invention 
may also be practiced in distributed computing environments where tasks are 
O performed by remote processing devices that are linked through a communications 

i.rj 

m network. In. a distributed computing environment, program modules may be located in 

jr? both local and remote computer storage media including memory storage devices. 

£ [0063] An example of a system for implementing the invention includes a computing 
device, such as computing device 10. In its most basic configuration, computing 

□ device 1 0 typically includes at least one processing unit 1 2 and memory 1 4. 

lS Depending on the exact configuration and type of computing device, memory 1 4 may 

Ljj be volatile (such as RAM), non-volatile (such as ROM, flash memory, etc.) or some 

combination of the two. This most basic configuration is illustrated by dashed line 1 6. 
Additionally, device 10 may also have additional features/functionality. For example, 
device 1 0 may also include additional storage (removable and/or non-removable) 
including, but not limited to, magnetic or optical disks or tape. Such additional 
storage is illustrated in by removable storage 18 and non-removable storage 20. 

[0064] 

Computer storage media includes volatile, nonvolatile, removable, and non- 
removable media implemented in any method or technology for storage of 
information such as computer readable instructions, data structures, program 
modules, or other data. Memory 14, removable storage 18, and non-removable 
storage 20 are all examples of computer storage media. Computer storage media 
includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory 
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technology, CDROM, digital versatile disks (DVD) or other optical storage, magnetic 
cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or 
any other medium which can be used to store the desired information and which can 
accessed by device 10. Any such computer storage media may be part of device 10. 

[0065] Device 1 0 may also contain communications connection(s) 22 that allow the 

device to communicate with other devices. Communications connection(s) 22 is an 
example of communication media. Communication media typically embodies 
computer readable instructions, data structures, program modules, or other data in a 
modulated data signal such as a carrier wave or other transport mechanism and 
includes any information delivery media. The term "modulated data signal" means a 
signal that has one or more of its characteristics set or changed in such a manner as 
to encode information in the signal. Byway of example, and not limitation, 
communication media includes wired media such as a wired network or direct-wired 
connection, and wireless media such as acoustic, RF, infrared and other wireless 
media. The term computer readable media as used herein includes both storage 
media and communication media. 

[0066] Device 1 0 may also have input device(s) 24 such as a keyboard, mouse, pen, voice 
input device, touch input device, etc. Output device(s) 26 such as a display, speakers, 
printer, etc. may also be included. All these devices are well known in the art and need 
not be discussed at length here. 

[0067] The methods that have been described can be computer-implemented on the 
device 10. A computer-implemented method is desirably realized at least in part as 
one or more programs running on a computer. The programs can be executed from a 
computer-readable medium such as a memory by a processor of a computer. The 
programs are desirably storable on a machine-readable medium, such as a floppy disk 
or a CD-ROM, for distribution and installation and execution on another computer. 
The program or programs can be a part of a computer system, a computer, or a 
computerized device. 

[0068] Conclusion 

[0069] 

It is noted that, although specific embodiments have been illustrated and 
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described herein, it will be appreciated by those of ordinary skill in the art that any 
arrangement that is calculated to achieve the same purpose may be substituted for 
the specific embodiments shown. This application is intended to cover any 
adaptations or variations of the present invention. Therefore, it is manifestly intended 
that this invention be limited only by the claims and equivalents thereof. 
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